Skip to content

Week 5 community contribution (profe-ssor): Research/Learning RAG#2068

Open
profe-ssor wants to merge 1 commit intoed-donner:mainfrom
profe-ssor:profe-ssor-week5-contributions
Open

Week 5 community contribution (profe-ssor): Research/Learning RAG#2068
profe-ssor wants to merge 1 commit intoed-donner:mainfrom
profe-ssor:profe-ssor-week5-contributions

Conversation

@profe-ssor
Copy link
Contributor

Week 5 Project: Research / Learning RAG Assistant
Summary
A RAG (Retrieval Augmented Generation) Q&A assistant for course notes and research-style documents. It answers questions using only the content you put in a knowledge base (no general knowledge), so it stays on-topic and avoidable.
What it does
Ingest: Loads markdown documents from a folder (default: week5 knowledge-base; can point to your own course or research notes).
Chunk & embed: Splits docs with RecursiveCharacterTextSplitter (500 chars, 100 overlap) and embeds with HuggingFace all-MiniLM-L6-v2 (no API key).
Store: Saves embeddings in Chroma (persistent vector DB).
Answer: For each question, retrieves the top‑k chunks, builds a context string, and calls an LLM with a system prompt: “Answer only from the context; if you don’t know, say so.”
UI: Gradio chat interface so you can ask questions in a browser.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant